Human Variation and Lexical Choice

نویسندگان

  • Ehud Reiter
  • Somayajulu Sripada
چکیده

Much natural language processing research implicitly assumes that word meanings are fixed in a language community, but in fact there is good evidence that different people probably associate slightly different meanings with words. We summarize some evidence for this claim from the literature and from an ongoing research project, and discuss its implications for natural language generation, especially for lexical choice, that is, choosing appropriate words for a generated text.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Machine Translation System Based on a Monotone Decoder

In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...

متن کامل

Discovering Demographic Language Variation

We propose a Bayesian generative model of how demographic social factors influence lexical choice. We apply the method to a corpus of geo-tagged Twitter messages originating from mobile phones, cross-referenced against U.S. Census demographic data. Our method discovers communities jointly defined by linguistic and demographic properties.

متن کامل

A Mixture Model of Demographic Lexical Variation

We propose a Bayesian generative model of how demographic social factors influence lexical choice. We apply the method to a corpus of geo-tagged Twitter messages originating from mobile phones, cross-referenced against U.S. Census demographic data. Our method discovers communities jointly defined by linguistic and demographic properties.

متن کامل

Nasal Coarticulation in Lexical Perception: The Role of Neighborhood-conditioned Variation

Nasal coarticulation has been shown to vary systematically in words depending on the number of phonological neighbors: words with many neighbors are produced with a greater degree of vowel nasality than words with fewer phonological neighbors [9]. This study examines the effect of this systematic low-level variation on lexical perception. The degree of nasality in natural real and nonsense word...

متن کامل

Prominence Mismatches and Differential Object Marking in Bantu

Majority of Bantu languages encode subjects by head-marking and objects by positional licensing. This reflects a point in the historical process whereby positional licensing of objects becomes obligatory due to the loss of inflecctional morphology. What we observe in synchronic grammar is considerable variation both across and within languages in the use of head-marking morphology for objects. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Linguistics

دوره 28  شماره 

صفحات  -

تاریخ انتشار 2002